Broccoli: Semantic Full-Text Search at your Fingertips
نویسندگان
چکیده
We present Broccoli, a fast and easy-to-use search engine for what we call semantic full-text search. Semantic full-text search combines the capabilities of standard full-text search and ontology search. The search operates on four kinds of objects: ordinary words (e.g. edible), classes (e.g. plants), instances (e.g. Broccoli), and relations (e.g. occurs-with or native-to). Queries are trees, where nodes are arbitrary bags of these objects, and arcs are relations. The user interface guides the user in incrementally constructing such trees by instant (search-as-you-type) suggestions of words, classes, instances, or relations that lead to good hits. Both standard full-text search and pure ontology search are included as special cases. In this paper, we describe the query language of Broccoli, a new kind of index that enables fast processing of queries from that language as well as fast query suggestion, the natural language processing required, and the user interface. We evaluated query times and result quality on the full version of the English Wikipedia (32 GB XML dump) combined with the YAGO ontology (26 million facts). We have implemented a fully-functional prototype based on our ideas, see http://broccoli.informatik.uni-freiburg.de.
منابع مشابه
Broccoli and Helicobacter Pylori: A Systematic Review
Objective Pharmacological treatment of Helicobacter pylori (H. pylori) infection is based on the use of at least two antibiotics with a double dose of proton pump inhibitor which results in antibiotic resistance. Anti-helicobacterial activity of sulforaphane-rich broccoli has been evaluated in laboratory studies. This study aimed to systematically review the conducted randomized clinical trials...
متن کاملThe ISMIR Cloud: A Decade of ISMIR Conferences at Your Fingertips
In this paper, we analyze the proceedings of the past International Symposia on Music Information Retrieval (ISMIR). We extract meaningful term sets from the accepted submissions and apply term weighting and Web-based filtering techniques to distill information about the topics covered by the papers. This enables us to visualize and interpret the change of hot ISMIR topics in the course of time...
متن کاملبررسی کاربرد فناوری معنایی برای سازماندهی اطلاعات در نرمافزارهای کتابخانه دیجیتالی
The present study was an attempt to investigate the use of semantic technologies to organize information in digital library software systems. The present study was a practical one which employed a descriptive survey method. The study sample consisted of three digital library software systems entitled Pars Azarakhsh, Parvan Pajoh, and Payam Mashregh. Data were collected through a checklist incl...
متن کاملAn Evaluation and Comparison of Current Peer-to-Peer Full-Text Keyword Search Techniques
Current peer-to-peer (p2p) full-text keyword search techniques fall into the following categories: document-based partitioning, keyword-based partitioning, hybrid indexing, and semantic search. This paper provides a performance evaluation and comparison of these p2p full-text keyword search techniques on a dataset with 3.7 million web pages and 6.8 million search queries. Our evaluation results...
متن کاملKeywords and RDF Fragments: Integrating Metadata and Full-Text Search in Beagle++
Full-text search engines and metadata repositories have so far investigated very different approaches to search, mainly due to their separate and different storage systems for information and data. As we have argued in previous papers, though, integrating full-text and metadata search capabilities is crucial for powerful semantic desktop search systems [3]. Semantic metadata is able to represen...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید
ثبت ناماگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید
ورودعنوان ژورنال:
- CoRR
دوره abs/1207.2615 شماره
صفحات -
تاریخ انتشار 2012